Basic Statistics

Raw Counts

Name Value
Rows 9,994
Columns 26
Discrete columns 16
Continuous columns 10
All missing columns 0
Missing observations 0
Complete Rows 9,994
Total observations 259,844
Memory allocation 2.7 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 9 columns ignored with more than 50 categories.
## order_id: 5009 categories
## order_date: 1237 categories
## ship_date: 1334 categories
## customer_id: 793 categories
## customer_name: 793 categories
## city: 531 categories
## postal_code: 631 categories
## product_id: 1862 categories
## product_name: 1850 categories

QQ Plot

Correlation Analysis

## 10 features with more than 20 categories ignored!
## order_id: 5009 categories
## order_date: 1237 categories
## ship_date: 1334 categories
## customer_id: 793 categories
## customer_name: 793 categories
## city: 531 categories
## state: 49 categories
## postal_code: 631 categories
## product_id: 1862 categories
## product_name: 1850 categories
## Warning in cor(x = structure(list(row_id = c(1, 2, 3, 4, 5, 6, 7, 8, 9, : the
## standard deviation is zero

Principal Component Analysis

## 9 features with more than 50 categories ignored!
## order_id: 5009 categories
## order_date: 1237 categories
## ship_date: 1334 categories
## customer_id: 793 categories
## customer_name: 793 categories
## city: 531 categories
## postal_code: 631 categories
## product_id: 1862 categories
## product_name: 1850 categories
## Warning in plot_prcomp(data = structure(list(row_id = c(1, 2, 3, 4, 5, 6, : The following features are dropped due to zero variance:
##  * country_United.States